Overview
Brought to you by YData
Dataset statistics
| Number of variables | 18 |
|---|---|
| Number of observations | 2099836 |
| Missing cells | 549 |
| Missing cells (%) | < 0.1% |
| Duplicate rows | 0 |
| Duplicate rows (%) | 0.0% |
| Total size in memory | 1.6 GiB |
| Average record size in memory | 813.6 B |
Variable types
| DateTime | 1 |
|---|---|
| Numeric | 7 |
| Text | 6 |
| Categorical | 4 |
alineación con portafolio estratégico is highly overall correlated with valor | High correlation |
cantidad is highly overall correlated with precio | High correlation |
categoria is highly overall correlated with categoria_macro | High correlation |
categoria_macro is highly overall correlated with categoria | High correlation |
id is highly overall correlated with pedido | High correlation |
pedido is highly overall correlated with id | High correlation |
precio is highly overall correlated with cantidad | High correlation |
valor is highly overall correlated with alineación con portafolio estratégico | High correlation |
cantidad is highly skewed (γ1 = 377.7159296) | Skewed |
precio is highly skewed (γ1 = 75.62012249) | Skewed |
valor is highly skewed (γ1 = 86.3526775) | Skewed |
alineación con portafolio estratégico is highly skewed (γ1 = -623.4425064) | Skewed |
Reproduction
| Analysis started | 2025-03-22 00:39:03.776530 |
|---|---|
| Analysis finished | 2025-03-22 00:40:42.142737 |
| Duration | 1 minute and 38.37 seconds |
| Software version | ydata-profiling vv4.12.2 |
| Download configuration | config.json |
Variables
fecha
Date
| Distinct | 756 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 16.0 MiB |
| Minimum | 1971-01-02 00:00:00 |
|---|---|
| Maximum | 1973-01-31 00:00:00 |
| Invalid dates | 0 |
| Invalid dates (%) | 0.0% |
pedido
Real number (ℝ)
High correlation 
| Distinct | 933935 |
|---|---|
| Distinct (%) | 44.5% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 467009.18 |
| Minimum | 2 |
|---|---|
| Maximum | 933936 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.0 MiB |
Quantile statistics
| Minimum | 2 |
|---|---|
| 5-th percentile | 46399 |
| Q1 | 231701 |
| median | 466824 |
| Q3 | 702678.25 |
| 95-th percentile | 887651 |
| Maximum | 933936 |
| Range | 933934 |
| Interquartile range (IQR) | 470977.25 |
Descriptive statistics
| Standard deviation | 270549.12 |
|---|---|
| Coefficient of variation (CV) | 0.57932291 |
| Kurtosis | -1.2073259 |
| Mean | 467009.18 |
| Median Absolute Deviation (MAD) | 235498 |
| Skewness | -0.0012157426 |
| Sum | 9.806427 × 1011 |
| Variance | 7.3196826 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 771273 | 78 | < 0.1% |
| 57130 | 57 | < 0.1% |
| 662366 | 48 | < 0.1% |
| 478732 | 44 | < 0.1% |
| 173770 | 44 | < 0.1% |
| 632112 | 41 | < 0.1% |
| 65592 | 40 | < 0.1% |
| 145891 | 40 | < 0.1% |
| 805154 | 40 | < 0.1% |
| 558223 | 40 | < 0.1% |
| Other values (933925) | 2099364 |
| Value | Count | Frequency (%) |
| 2 | 1 | |
| 3 | 1 | |
| 4 | 1 | |
| 5 | 1 | |
| 6 | 2 | |
| 7 | 2 | |
| 8 | 1 | |
| 9 | 1 | |
| 10 | 1 | |
| 11 | 1 |
| Value | Count | Frequency (%) |
| 933936 | 2 | |
| 933935 | 1 | < 0.1% |
| 933934 | 1 | < 0.1% |
| 933933 | 3 | |
| 933932 | 1 | < 0.1% |
| 933931 | 1 | < 0.1% |
| 933930 | 1 | < 0.1% |
| 933929 | 2 | |
| 933928 | 2 | |
| 933927 | 4 |
id
Real number (ℝ)
High correlation 
| Distinct | 419226 |
|---|---|
| Distinct (%) | 20.0% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 165352 |
| Minimum | 1 |
|---|---|
| Maximum | 419226 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.0 MiB |
Quantile statistics
| Minimum | 1 |
|---|---|
| 5-th percentile | 8980 |
| Q1 | 58294 |
| median | 137973 |
| Q3 | 261744.25 |
| 95-th percentile | 384371.25 |
| Maximum | 419226 |
| Range | 419225 |
| Interquartile range (IQR) | 203450.25 |
Descriptive statistics
| Standard deviation | 121459.27 |
|---|---|
| Coefficient of variation (CV) | 0.73454977 |
| Kurtosis | -1.0263245 |
| Mean | 165352 |
| Median Absolute Deviation (MAD) | 94770 |
| Skewness | 0.45491189 |
| Sum | 3.4721208 × 1011 |
| Variance | 1.4752355 × 1010 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 3037 | 1743 | 0.1% |
| 1236 | 1682 | 0.1% |
| 2906 | 1565 | 0.1% |
| 3121 | 1366 | 0.1% |
| 3357 | 1321 | 0.1% |
| 2412 | 1173 | 0.1% |
| 30822 | 998 | < 0.1% |
| 4206 | 892 | < 0.1% |
| 512 | 650 | < 0.1% |
| 3038 | 461 | < 0.1% |
| Other values (419216) | 2087985 |
| Value | Count | Frequency (%) |
| 1 | 85 | |
| 2 | 18 | < 0.1% |
| 3 | 1 | < 0.1% |
| 4 | 1 | < 0.1% |
| 5 | 5 | < 0.1% |
| 6 | 36 | |
| 7 | 5 | < 0.1% |
| 8 | 9 | < 0.1% |
| 9 | 23 | < 0.1% |
| 10 | 5 | < 0.1% |
| Value | Count | Frequency (%) |
| 419226 | 2 | < 0.1% |
| 419225 | 1 | < 0.1% |
| 419224 | 1 | < 0.1% |
| 419223 | 1 | < 0.1% |
| 419222 | 1 | < 0.1% |
| 419221 | 11 | |
| 419220 | 8 | |
| 419219 | 1 | < 0.1% |
| 419218 | 6 | |
| 419217 | 2 | < 0.1% |
edad
Real number (ℝ)
| Distinct | 50 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 41.870732 |
| Minimum | 18 |
|---|---|
| Maximum | 67 |
| Zeros | 0 |
| Zeros (%) | 0.0% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.0 MiB |
Quantile statistics
| Minimum | 18 |
|---|---|
| 5-th percentile | 29 |
| Q1 | 32 |
| median | 43 |
| Q3 | 49 |
| 95-th percentile | 58 |
| Maximum | 67 |
| Range | 49 |
| Interquartile range (IQR) | 17 |
Descriptive statistics
| Standard deviation | 9.5630711 |
|---|---|
| Coefficient of variation (CV) | 0.22839513 |
| Kurtosis | -0.92834379 |
| Mean | 41.870732 |
| Median Absolute Deviation (MAD) | 8 |
| Skewness | 0.17185007 |
| Sum | 87921670 |
| Variance | 91.452329 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 31 | 332758 | 15.8% |
| 43 | 110451 | 5.3% |
| 47 | 102710 | 4.9% |
| 41 | 98523 | 4.7% |
| 45 | 93176 | 4.4% |
| 50 | 78762 | 3.8% |
| 46 | 78428 | 3.7% |
| 30 | 74883 | 3.6% |
| 42 | 74559 | 3.6% |
| 40 | 70853 | 3.4% |
| Other values (40) | 984733 |
| Value | Count | Frequency (%) |
| 18 | 11 | < 0.1% |
| 19 | 240 | < 0.1% |
| 20 | 216 | < 0.1% |
| 21 | 569 | < 0.1% |
| 22 | 4143 | 0.2% |
| 23 | 7722 | |
| 24 | 5801 | 0.3% |
| 25 | 7831 | |
| 26 | 12352 | |
| 27 | 17506 |
| Value | Count | Frequency (%) |
| 67 | 294 | < 0.1% |
| 66 | 22 | < 0.1% |
| 65 | 90 | < 0.1% |
| 64 | 20375 | |
| 63 | 2055 | 0.1% |
| 62 | 6806 | 0.3% |
| 61 | 5553 | 0.3% |
| 60 | 30346 | |
| 59 | 25744 | |
| 58 | 30632 |
municipio
Text
| Distinct | 808 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 130.7 MiB |
Length
| Max length | 27 |
|---|---|
| Median length | 26 |
| Mean length | 8.257823 |
| Min length | 3 |
Unique
| Unique | 32 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | EL CARMEN DE CHUCURI |
|---|---|
| 2nd row | VILLANUEVA |
| 3rd row | VILLANUEVA |
| 4th row | VILLANUEVA |
| 5th row | ARROYOHONDO |
| Value | Count | Frequency (%) |
| curiti | 576380 | |
| natagaima | 344983 | 13.7% |
| villanueva | 142303 | 5.7% |
| guatica | 130383 | 5.2% |
| girardota | 76950 | 3.1% |
| de | 73799 | 2.9% |
| santa | 53719 | 2.1% |
| don | 41577 | 1.7% |
| matias | 41577 | 1.7% |
| la | 41484 | 1.7% |
| Other values (809) | 986380 |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 3487137 | |
| I | 2314198 | |
| T | 1510427 | |
| R | 1254442 | 7.2% |
| U | 1172303 | 6.8% |
| C | 1074410 | 6.2% |
| N | 993913 | 5.7% |
| E | 756228 | 4.4% |
| G | 702421 | 4.1% |
| O | 647547 | 3.7% |
| Other values (28) | 3427048 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 16929904 | |
| Space Separator | 409699 | 2.4% |
| Other Punctuation | 276 | < 0.1% |
| Lowercase Letter | 164 | < 0.1% |
| Decimal Number | 21 | < 0.1% |
| Open Punctuation | 5 | < 0.1% |
| Close Punctuation | 5 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 3487137 | |
| I | 2314198 | |
| T | 1510427 | |
| R | 1254442 | 7.4% |
| U | 1172303 | 6.9% |
| C | 1074410 | 6.3% |
| N | 993913 | 5.9% |
| E | 756228 | 4.5% |
| G | 702421 | 4.1% |
| O | 647547 | 3.8% |
| Other values (14) | 3016878 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 41 | |
| d | 36 | |
| o | 23 | |
| c | 23 | |
| l | 18 | |
| i | 18 | |
| í | 5 | 3.0% |
Decimal Number
| Value | Count | Frequency (%) |
| 7 | 7 | |
| 3 | 7 | |
| 6 | 7 |
Space Separator
| Value | Count | Frequency (%) |
| 409699 |
Other Punctuation
| Value | Count | Frequency (%) |
| . | 276 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16930068 | |
| Common | 410006 | 2.4% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 3487137 | |
| I | 2314198 | |
| T | 1510427 | |
| R | 1254442 | 7.4% |
| U | 1172303 | 6.9% |
| C | 1074410 | 6.3% |
| N | 993913 | 5.9% |
| E | 756228 | 4.5% |
| G | 702421 | 4.1% |
| O | 647547 | 3.8% |
| Other values (21) | 3017042 |
Common
| Value | Count | Frequency (%) |
| 409699 | ||
| . | 276 | 0.1% |
| 7 | 7 | < 0.1% |
| 3 | 7 | < 0.1% |
| 6 | 7 | < 0.1% |
| ( | 5 | < 0.1% |
| ) | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17340069 | |
| None | 5 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 3487137 | |
| I | 2314198 | |
| T | 1510427 | |
| R | 1254442 | 7.2% |
| U | 1172303 | 6.8% |
| C | 1074410 | 6.2% |
| N | 993913 | 5.7% |
| E | 756228 | 4.4% |
| G | 702421 | 4.1% |
| O | 647547 | 3.7% |
| Other values (27) | 3427043 |
None
| Value | Count | Frequency (%) |
| í | 5 |
zona
Categorical
| Distinct | 34 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 18 |
| Missing (%) | < 0.1% |
| Memory size | 133.2 MiB |
| SANTANDER | |
|---|---|
| TOLIMA | |
| ANTIOQUIA | |
| LA GUAJIRA | |
| RISARALDA | |
| Other values (29) |
Length
| Max length | 15 |
|---|---|
| Median length | 9 |
| Mean length | 8.402673 |
| Min length | 4 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | SANTANDER |
|---|---|
| 2nd row | LA GUAJIRA |
| 3rd row | LA GUAJIRA |
| 4th row | LA GUAJIRA |
| 5th row | BOLÍVAR |
Common Values
| Value | Count | Frequency (%) |
| SANTANDER | 739964 | |
| TOLIMA | 407991 | |
| ANTIOQUIA | 224114 | 10.7% |
| LA GUAJIRA | 145807 | 6.9% |
| RISARALDA | 132904 | 6.3% |
| CUNDINAMARCA | 102305 | 4.9% |
| NORTE SANTANDER | 59017 | 2.8% |
| BOYACA | 45454 | 2.2% |
| HUILA | 36290 | 1.7% |
| META | 32602 | 1.6% |
| Other values (24) | 173370 | 8.3% |
Length
| Value | Count | Frequency (%) |
| santander | 798981 | |
| tolima | 407991 | |
| antioquia | 224114 | 9.7% |
| la | 145807 | 6.3% |
| guajira | 145807 | 6.3% |
| risaralda | 132904 | 5.8% |
| cundinamarca | 102305 | 4.4% |
| norte | 59017 | 2.6% |
| boyaca | 45454 | 2.0% |
| huila | 36290 | 1.6% |
| Other values (26) | 206089 | 8.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| A | 4008766 | |
| N | 2169725 | |
| T | 1583306 | 9.0% |
| R | 1447066 | 8.2% |
| I | 1330627 | 7.5% |
| D | 1054626 | 6.0% |
| S | 972829 | 5.5% |
| E | 956562 | 5.4% |
| L | 837651 | 4.7% |
| O | 822694 | 4.7% |
| Other values (24) | 2460232 |
Most occurring categories
| Value | Count | Frequency (%) |
| Uppercase Letter | 17439113 | |
| Space Separator | 204941 | 1.2% |
| Lowercase Letter | 20 | < 0.1% |
| Open Punctuation | 5 | < 0.1% |
| Close Punctuation | 5 | < 0.1% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| A | 4008766 | |
| N | 2169725 | |
| T | 1583306 | 9.1% |
| R | 1447066 | 8.3% |
| I | 1330627 | 7.6% |
| D | 1054626 | 6.0% |
| S | 972829 | 5.6% |
| E | 956562 | 5.5% |
| L | 837651 | 4.8% |
| O | 822694 | 4.7% |
| Other values (17) | 2255261 |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 5 | |
| c | 5 | |
| í | 5 | |
| o | 5 |
Space Separator
| Value | Count | Frequency (%) |
| 204941 |
Open Punctuation
| Value | Count | Frequency (%) |
| ( | 5 |
Close Punctuation
| Value | Count | Frequency (%) |
| ) | 5 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17439133 | |
| Common | 204951 | 1.2% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| A | 4008766 | |
| N | 2169725 | |
| T | 1583306 | 9.1% |
| R | 1447066 | 8.3% |
| I | 1330627 | 7.6% |
| D | 1054626 | 6.0% |
| S | 972829 | 5.6% |
| E | 956562 | 5.5% |
| L | 837651 | 4.8% |
| O | 822694 | 4.7% |
| Other values (21) | 2255281 |
Common
| Value | Count | Frequency (%) |
| 204941 | ||
| ( | 5 | < 0.1% |
| ) | 5 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 17573099 | |
| None | 70985 | 0.4% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| A | 4008766 | |
| N | 2169725 | |
| T | 1583306 | 9.0% |
| R | 1447066 | 8.2% |
| I | 1330627 | 7.6% |
| D | 1054626 | 6.0% |
| S | 972829 | 5.5% |
| E | 956562 | 5.4% |
| L | 837651 | 4.8% |
| O | 822694 | 4.7% |
| Other values (18) | 2389247 |
None
| Value | Count | Frequency (%) |
| Á | 29632 | |
| Ñ | 19021 | |
| Í | 17924 | |
| Ó | 4286 | 6.0% |
| É | 117 | 0.2% |
| í | 5 | < 0.1% |
asesor
Text
| Distinct | 608 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 133.4 MiB |
Length
| Max length | 10 |
|---|---|
| Median length | 10 |
| Mean length | 9.6363716 |
| Min length | 8 |
Unique
| Unique | 19 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | asesor_2 |
|---|---|
| 2nd row | asesor_3 |
| 3rd row | asesor_4 |
| 4th row | asesor_5 |
| 5th row | asesor_6 |
| Value | Count | Frequency (%) |
| asesor_137 | 14230 | 0.7% |
| asesor_7 | 14230 | 0.7% |
| asesor_45 | 13216 | 0.6% |
| asesor_256 | 13000 | 0.6% |
| asesor_170 | 12804 | 0.6% |
| asesor_165 | 12379 | 0.6% |
| asesor_13 | 12254 | 0.6% |
| asesor_139 | 12078 | 0.6% |
| asesor_149 | 11977 | 0.6% |
| asesor_6 | 11877 | 0.6% |
| Other values (598) | 1971791 |
Most occurring characters
| Value | Count | Frequency (%) |
| s | 4199672 | |
| a | 2099836 | |
| e | 2099836 | |
| o | 2099836 | |
| r | 2099836 | |
| _ | 2099836 | |
| 1 | 1121748 | 5.5% |
| 2 | 871054 | 4.3% |
| 3 | 639767 | 3.2% |
| 4 | 491297 | 2.4% |
| Other values (6) | 2412082 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 12599016 | |
| Decimal Number | 5535948 | |
| Connector Punctuation | 2099836 | 10.4% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1121748 | |
| 2 | 871054 | |
| 3 | 639767 | |
| 4 | 491297 | |
| 7 | 458887 | |
| 5 | 439879 | 7.9% |
| 6 | 417157 | 7.5% |
| 8 | 404211 | 7.3% |
| 9 | 376879 | 6.8% |
| 0 | 315069 | 5.7% |
Lowercase Letter
| Value | Count | Frequency (%) |
| s | 4199672 | |
| a | 2099836 | |
| e | 2099836 | |
| o | 2099836 | |
| r | 2099836 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2099836 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 12599016 | |
| Common | 7635784 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| _ | 2099836 | |
| 1 | 1121748 | |
| 2 | 871054 | |
| 3 | 639767 | 8.4% |
| 4 | 491297 | 6.4% |
| 7 | 458887 | 6.0% |
| 5 | 439879 | 5.8% |
| 6 | 417157 | 5.5% |
| 8 | 404211 | 5.3% |
| 9 | 376879 | 4.9% |
Latin
| Value | Count | Frequency (%) |
| s | 4199672 | |
| a | 2099836 | |
| e | 2099836 | |
| o | 2099836 | |
| r | 2099836 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 20234800 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| s | 4199672 | |
| a | 2099836 | |
| e | 2099836 | |
| o | 2099836 | |
| r | 2099836 | |
| _ | 2099836 | |
| 1 | 1121748 | 5.5% |
| 2 | 871054 | 4.3% |
| 3 | 639767 | 3.2% |
| 4 | 491297 | 2.4% |
| Other values (6) | 2412082 |
punto de venta
Text
| Distinct | 66 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 141.4 MiB |
Length
| Max length | 14 |
|---|---|
| Median length | 14 |
| Mean length | 13.631986 |
| Min length | 13 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | punto_venta_2 |
|---|---|
| 2nd row | punto_venta_2 |
| 3rd row | punto_venta_2 |
| 4th row | punto_venta_3 |
| 5th row | punto_venta_4 |
| Value | Count | Frequency (%) |
| punto_venta_7 | 132386 | 6.3% |
| punto_venta_4 | 123307 | 5.9% |
| punto_venta_2 | 118419 | 5.6% |
| punto_venta_6 | 116560 | 5.6% |
| punto_venta_10 | 106348 | 5.1% |
| punto_venta_9 | 101083 | 4.8% |
| punto_venta_21 | 97357 | 4.6% |
| punto_venta_1 | 73464 | 3.5% |
| punto_venta_34 | 58383 | 2.8% |
| punto_venta_20 | 56943 | 2.7% |
| Other values (56) | 1115586 |
Most occurring characters
| Value | Count | Frequency (%) |
| n | 4199672 | |
| t | 4199672 | |
| _ | 4199672 | |
| p | 2099836 | |
| o | 2099836 | |
| v | 2099836 | |
| e | 2099836 | |
| a | 2099836 | |
| u | 2099836 | |
| 2 | 735466 | 2.6% |
| Other values (9) | 2691437 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 20998360 | |
| Connector Punctuation | 4199672 | 14.7% |
| Decimal Number | 3426903 | 12.0% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 735466 | |
| 1 | 674433 | |
| 3 | 550932 | |
| 4 | 299012 | |
| 7 | 261590 | 7.6% |
| 0 | 207129 | 6.0% |
| 6 | 203344 | 5.9% |
| 5 | 191679 | 5.6% |
| 9 | 168879 | 4.9% |
| 8 | 134439 | 3.9% |
Lowercase Letter
| Value | Count | Frequency (%) |
| n | 4199672 | |
| t | 4199672 | |
| p | 2099836 | |
| o | 2099836 | |
| v | 2099836 | |
| e | 2099836 | |
| a | 2099836 | |
| u | 2099836 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4199672 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 20998360 | |
| Common | 7626575 | 26.6% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| _ | 4199672 | |
| 2 | 735466 | 9.6% |
| 1 | 674433 | 8.8% |
| 3 | 550932 | 7.2% |
| 4 | 299012 | 3.9% |
| 7 | 261590 | 3.4% |
| 0 | 207129 | 2.7% |
| 6 | 203344 | 2.7% |
| 5 | 191679 | 2.5% |
| 9 | 168879 | 2.2% |
Latin
| Value | Count | Frequency (%) |
| n | 4199672 | |
| t | 4199672 | |
| p | 2099836 | |
| o | 2099836 | |
| v | 2099836 | |
| e | 2099836 | |
| a | 2099836 | |
| u | 2099836 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 28624935 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| n | 4199672 | |
| t | 4199672 | |
| _ | 4199672 | |
| p | 2099836 | |
| o | 2099836 | |
| v | 2099836 | |
| e | 2099836 | |
| a | 2099836 | |
| u | 2099836 | |
| 2 | 735466 | 2.6% |
| Other values (9) | 2691437 |
cluster
Categorical
| Distinct | 9 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 146.2 MiB |
| cluster_tienda_3 | |
|---|---|
| cluster_tienda_2 | |
| cluster_tienda_1 | |
| cluster_tienda_4 | |
| cluster_tienda_5 | 26109 |
| Other values (4) | 2760 |
Length
| Max length | 16 |
|---|---|
| Median length | 16 |
| Mean length | 16 |
| Min length | 16 |
Unique
| Unique | 1 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | cluster_tienda_2 |
|---|---|
| 2nd row | cluster_tienda_2 |
| 3rd row | cluster_tienda_2 |
| 4th row | cluster_tienda_3 |
| 5th row | cluster_tienda_2 |
Common Values
| Value | Count | Frequency (%) |
| cluster_tienda_3 | 926405 | |
| cluster_tienda_2 | 839005 | |
| cluster_tienda_1 | 207099 | 9.9% |
| cluster_tienda_4 | 98458 | 4.7% |
| cluster_tienda_5 | 26109 | 1.2% |
| cluster_tienda_6 | 1216 | 0.1% |
| cluster_tienda_8 | 791 | < 0.1% |
| cluster_tienda_7 | 752 | < 0.1% |
| cluster_tienda_9 | 1 | < 0.1% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| cluster_tienda_3 | 926405 | |
| cluster_tienda_2 | 839005 | |
| cluster_tienda_1 | 207099 | 9.9% |
| cluster_tienda_4 | 98458 | 4.7% |
| cluster_tienda_5 | 26109 | 1.2% |
| cluster_tienda_6 | 1216 | 0.1% |
| cluster_tienda_8 | 791 | < 0.1% |
| cluster_tienda_7 | 752 | < 0.1% |
| cluster_tienda_9 | 1 | < 0.1% |
Most occurring characters
| Value | Count | Frequency (%) |
| t | 4199672 | |
| e | 4199672 | |
| _ | 4199672 | |
| c | 2099836 | 6.2% |
| i | 2099836 | 6.2% |
| a | 2099836 | 6.2% |
| l | 2099836 | 6.2% |
| n | 2099836 | 6.2% |
| d | 2099836 | 6.2% |
| r | 2099836 | 6.2% |
| Other values (11) | 6299508 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 27297868 | |
| Connector Punctuation | 4199672 | 12.5% |
| Decimal Number | 2099836 | 6.2% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| t | 4199672 | |
| e | 4199672 | |
| c | 2099836 | |
| i | 2099836 | |
| a | 2099836 | |
| l | 2099836 | |
| n | 2099836 | |
| d | 2099836 | |
| r | 2099836 | |
| s | 2099836 |
Decimal Number
| Value | Count | Frequency (%) |
| 3 | 926405 | |
| 2 | 839005 | |
| 1 | 207099 | 9.9% |
| 4 | 98458 | 4.7% |
| 5 | 26109 | 1.2% |
| 6 | 1216 | 0.1% |
| 8 | 791 | < 0.1% |
| 7 | 752 | < 0.1% |
| 9 | 1 | < 0.1% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4199672 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 27297868 | |
| Common | 6299508 | 18.8% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| t | 4199672 | |
| e | 4199672 | |
| c | 2099836 | |
| i | 2099836 | |
| a | 2099836 | |
| l | 2099836 | |
| n | 2099836 | |
| d | 2099836 | |
| r | 2099836 | |
| s | 2099836 |
Common
| Value | Count | Frequency (%) |
| _ | 4199672 | |
| 3 | 926405 | 14.7% |
| 2 | 839005 | 13.3% |
| 1 | 207099 | 3.3% |
| 4 | 98458 | 1.6% |
| 5 | 26109 | 0.4% |
| 6 | 1216 | < 0.1% |
| 8 | 791 | < 0.1% |
| 7 | 752 | < 0.1% |
| 9 | 1 | < 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 33597376 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| t | 4199672 | |
| e | 4199672 | |
| _ | 4199672 | |
| c | 2099836 | 6.2% |
| i | 2099836 | 6.2% |
| a | 2099836 | 6.2% |
| l | 2099836 | 6.2% |
| n | 2099836 | 6.2% |
| d | 2099836 | 6.2% |
| r | 2099836 | 6.2% |
| Other values (11) | 6299508 |
categoria_macro
Categorical
High correlation 
| Distinct | 5 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 148.2 MiB |
| categoria_macro_2 | |
|---|---|
| categoria_macro_4 | |
| categoria_macro_1 | |
| categoria_macro_3 | 96038 |
| categoria_macro_5 | 9288 |
Length
| Max length | 17 |
|---|---|
| Median length | 17 |
| Mean length | 17 |
| Min length | 17 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | categoria_macro_1 |
|---|---|
| 2nd row | categoria_macro_2 |
| 3rd row | categoria_macro_3 |
| 4th row | categoria_macro_2 |
| 5th row | categoria_macro_4 |
Common Values
| Value | Count | Frequency (%) |
| categoria_macro_2 | 1304967 | |
| categoria_macro_4 | 502419 | 23.9% |
| categoria_macro_1 | 187124 | 8.9% |
| categoria_macro_3 | 96038 | 4.6% |
| categoria_macro_5 | 9288 | 0.4% |
Length
Common Values (Plot)
| Value | Count | Frequency (%) |
| categoria_macro_2 | 1304967 | |
| categoria_macro_4 | 502419 | 23.9% |
| categoria_macro_1 | 187124 | 8.9% |
| categoria_macro_3 | 96038 | 4.6% |
| categoria_macro_5 | 9288 | 0.4% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 6299508 | |
| c | 4199672 | |
| o | 4199672 | |
| r | 4199672 | |
| _ | 4199672 | |
| t | 2099836 | 5.9% |
| e | 2099836 | 5.9% |
| g | 2099836 | 5.9% |
| i | 2099836 | 5.9% |
| m | 2099836 | 5.9% |
| Other values (5) | 2099836 | 5.9% |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 29397704 | |
| Connector Punctuation | 4199672 | 11.8% |
| Decimal Number | 2099836 | 5.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 6299508 | |
| c | 4199672 | |
| o | 4199672 | |
| r | 4199672 | |
| t | 2099836 | 7.1% |
| e | 2099836 | 7.1% |
| g | 2099836 | 7.1% |
| i | 2099836 | 7.1% |
| m | 2099836 | 7.1% |
Decimal Number
| Value | Count | Frequency (%) |
| 2 | 1304967 | |
| 4 | 502419 | 23.9% |
| 1 | 187124 | 8.9% |
| 3 | 96038 | 4.6% |
| 5 | 9288 | 0.4% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 4199672 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 29397704 | |
| Common | 6299508 | 17.6% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 6299508 | |
| c | 4199672 | |
| o | 4199672 | |
| r | 4199672 | |
| t | 2099836 | 7.1% |
| e | 2099836 | 7.1% |
| g | 2099836 | 7.1% |
| i | 2099836 | 7.1% |
| m | 2099836 | 7.1% |
Common
| Value | Count | Frequency (%) |
| _ | 4199672 | |
| 2 | 1304967 | 20.7% |
| 4 | 502419 | 8.0% |
| 1 | 187124 | 3.0% |
| 3 | 96038 | 1.5% |
| 5 | 9288 | 0.1% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 35697212 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 6299508 | |
| c | 4199672 | |
| o | 4199672 | |
| r | 4199672 | |
| _ | 4199672 | |
| t | 2099836 | 5.9% |
| e | 2099836 | 5.9% |
| g | 2099836 | 5.9% |
| i | 2099836 | 5.9% |
| m | 2099836 | 5.9% |
| Other values (5) | 2099836 | 5.9% |
categoria
Categorical
High correlation 
| Distinct | 27 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 136.6 MiB |
| categoria_3 | |
|---|---|
| categoria_7 | |
| categoria_5 | |
| categoria_11 | |
| categoria_1 | |
| Other values (22) |
Length
| Max length | 12 |
|---|---|
| Median length | 11 |
| Mean length | 11.213215 |
| Min length | 11 |
Unique
| Unique | 0 ? |
|---|---|
| Unique (%) | 0.0% |
Sample
| 1st row | categoria_2 |
|---|---|
| 2nd row | categoria_3 |
| 3rd row | categoria_4 |
| 4th row | categoria_5 |
| 5th row | categoria_6 |
Common Values
| Value | Count | Frequency (%) |
| categoria_3 | 459528 | |
| categoria_7 | 435661 | |
| categoria_5 | 288953 | |
| categoria_11 | 156428 | 7.4% |
| categoria_1 | 149293 | 7.1% |
| categoria_12 | 132750 | 6.3% |
| categoria_8 | 119130 | 5.7% |
| categoria_9 | 80231 | 3.8% |
| categoria_10 | 53115 | 2.5% |
| categoria_6 | 49806 | 2.4% |
| Other values (17) | 174941 | 8.3% |
Length
| Value | Count | Frequency (%) |
| categoria_3 | 459528 | |
| categoria_7 | 435661 | |
| categoria_5 | 288953 | |
| categoria_11 | 156428 | 7.4% |
| categoria_1 | 149293 | 7.1% |
| categoria_12 | 132750 | 6.3% |
| categoria_8 | 119130 | 5.7% |
| categoria_9 | 80231 | 3.8% |
| categoria_10 | 53115 | 2.5% |
| categoria_6 | 49806 | 2.4% |
| Other values (17) | 174941 | 8.3% |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4199672 | |
| c | 2099836 | |
| t | 2099836 | |
| e | 2099836 | |
| g | 2099836 | |
| o | 2099836 | |
| r | 2099836 | |
| i | 2099836 | |
| _ | 2099836 | |
| 1 | 733652 | 3.1% |
| Other values (9) | 1813900 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 18898524 | |
| Decimal Number | 2547552 | 10.8% |
| Connector Punctuation | 2099836 | 8.9% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 733652 | |
| 3 | 473505 | |
| 7 | 440274 | |
| 5 | 297406 | |
| 2 | 211316 | 8.3% |
| 8 | 119549 | 4.7% |
| 6 | 92463 | 3.6% |
| 9 | 80905 | 3.2% |
| 0 | 60335 | 2.4% |
| 4 | 38147 | 1.5% |
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4199672 | |
| c | 2099836 | |
| t | 2099836 | |
| e | 2099836 | |
| g | 2099836 | |
| o | 2099836 | |
| r | 2099836 | |
| i | 2099836 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2099836 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 18898524 | |
| Common | 4647388 | 19.7% |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| _ | 2099836 | |
| 1 | 733652 | 15.8% |
| 3 | 473505 | 10.2% |
| 7 | 440274 | 9.5% |
| 5 | 297406 | 6.4% |
| 2 | 211316 | 4.5% |
| 8 | 119549 | 2.6% |
| 6 | 92463 | 2.0% |
| 9 | 80905 | 1.7% |
| 0 | 60335 | 1.3% |
Latin
| Value | Count | Frequency (%) |
| a | 4199672 | |
| c | 2099836 | |
| t | 2099836 | |
| e | 2099836 | |
| g | 2099836 | |
| o | 2099836 | |
| r | 2099836 | |
| i | 2099836 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 23545912 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4199672 | |
| c | 2099836 | |
| t | 2099836 | |
| e | 2099836 | |
| g | 2099836 | |
| o | 2099836 | |
| r | 2099836 | |
| i | 2099836 | |
| _ | 2099836 | |
| 1 | 733652 | 3.1% |
| Other values (9) | 1813900 |
subcategoria
Text
| Distinct | 102 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 143.0 MiB |
Length
| Max length | 16 |
|---|---|
| Median length | 14 |
| Mean length | 14.432911 |
| Min length | 14 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | subcategoria_2 |
|---|---|
| 2nd row | subcategoria_3 |
| 3rd row | subcategoria_4 |
| 4th row | subcategoria_5 |
| 5th row | subcategoria_6 |
| Value | Count | Frequency (%) |
| subcategoria_5 | 663877 | |
| subcategoria_3 | 241198 | 11.5% |
| subcategoria_9 | 151660 | 7.2% |
| subcategoria_12 | 74603 | 3.6% |
| subcategoria_14 | 73388 | 3.5% |
| subcategoria_22 | 66665 | 3.2% |
| subcategoria_7 | 61671 | 2.9% |
| subcategoria_39 | 52424 | 2.5% |
| subcategoria_13 | 50023 | 2.4% |
| subcategoria_24 | 46695 | 2.2% |
| Other values (92) | 617632 |
Most occurring characters
| Value | Count | Frequency (%) |
| a | 4199672 | |
| s | 2099836 | 6.9% |
| g | 2099836 | 6.9% |
| u | 2099836 | 6.9% |
| i | 2099836 | 6.9% |
| r | 2099836 | 6.9% |
| o | 2099836 | 6.9% |
| _ | 2099836 | 6.9% |
| e | 2099836 | 6.9% |
| t | 2099836 | 6.9% |
| Other values (12) | 7208551 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 25198032 | |
| Decimal Number | 3008879 | 9.9% |
| Connector Punctuation | 2099836 | 6.9% |
Most frequent character per category
Lowercase Letter
| Value | Count | Frequency (%) |
| a | 4199672 | |
| s | 2099836 | |
| g | 2099836 | |
| u | 2099836 | |
| i | 2099836 | |
| r | 2099836 | |
| o | 2099836 | |
| e | 2099836 | |
| t | 2099836 | |
| c | 2099836 |
Decimal Number
| Value | Count | Frequency (%) |
| 5 | 776736 | |
| 3 | 479481 | |
| 2 | 458167 | |
| 1 | 416572 | |
| 4 | 267289 | 8.9% |
| 9 | 227900 | 7.6% |
| 7 | 144637 | 4.8% |
| 0 | 91787 | 3.1% |
| 6 | 85964 | 2.9% |
| 8 | 60346 | 2.0% |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2099836 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 25198032 | |
| Common | 5108715 | 16.9% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| a | 4199672 | |
| s | 2099836 | |
| g | 2099836 | |
| u | 2099836 | |
| i | 2099836 | |
| r | 2099836 | |
| o | 2099836 | |
| e | 2099836 | |
| t | 2099836 | |
| c | 2099836 |
Common
| Value | Count | Frequency (%) |
| _ | 2099836 | |
| 5 | 776736 | 15.2% |
| 3 | 479481 | 9.4% |
| 2 | 458167 | 9.0% |
| 1 | 416572 | 8.2% |
| 4 | 267289 | 5.2% |
| 9 | 227900 | 4.5% |
| 7 | 144637 | 2.8% |
| 0 | 91787 | 1.8% |
| 6 | 85964 | 1.7% |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 30306747 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| a | 4199672 | |
| s | 2099836 | 6.9% |
| g | 2099836 | 6.9% |
| u | 2099836 | 6.9% |
| i | 2099836 | 6.9% |
| r | 2099836 | 6.9% |
| o | 2099836 | 6.9% |
| _ | 2099836 | 6.9% |
| e | 2099836 | 6.9% |
| t | 2099836 | 6.9% |
| Other values (12) | 7208551 |
producto
Text
| Distinct | 7280 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 138.1 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 12 |
| Mean length | 11.978037 |
| Min length | 10 |
Unique
| Unique | 1036 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | producto_2 |
|---|---|
| 2nd row | producto_3 |
| 3rd row | producto_4 |
| 4th row | producto_5 |
| 5th row | producto_6 |
| Value | Count | Frequency (%) |
| producto_49 | 53825 | 2.6% |
| producto_19 | 49097 | 2.3% |
| producto_72 | 26394 | 1.3% |
| producto_176 | 25418 | 1.2% |
| producto_40 | 24688 | 1.2% |
| producto_3 | 23448 | 1.1% |
| producto_119 | 21339 | 1.0% |
| producto_28 | 20676 | 1.0% |
| producto_110 | 20628 | 1.0% |
| producto_67 | 20391 | 1.0% |
| Other values (7270) | 1813932 |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 4199672 | |
| p | 2099836 | |
| d | 2099836 | |
| u | 2099836 | |
| c | 2099836 | |
| t | 2099836 | |
| _ | 2099836 | |
| r | 2099836 | |
| 1 | 1117368 | 4.4% |
| 2 | 753269 | 3.0% |
| Other values (8) | 4382752 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 16798688 | |
| Decimal Number | 6253389 | 24.9% |
| Connector Punctuation | 2099836 | 8.3% |
Most frequent character per category
Decimal Number
| Value | Count | Frequency (%) |
| 1 | 1117368 | |
| 2 | 753269 | |
| 3 | 732386 | |
| 4 | 629896 | |
| 7 | 543362 | |
| 6 | 541610 | |
| 9 | 537752 | |
| 5 | 493316 | |
| 8 | 461200 | |
| 0 | 443230 | 7.1% |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 4199672 | |
| p | 2099836 | |
| d | 2099836 | |
| u | 2099836 | |
| c | 2099836 | |
| t | 2099836 | |
| r | 2099836 |
Connector Punctuation
| Value | Count | Frequency (%) |
| _ | 2099836 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 16798688 | |
| Common | 8353225 |
Most frequent character per script
Common
| Value | Count | Frequency (%) |
| _ | 2099836 | |
| 1 | 1117368 | |
| 2 | 753269 | 9.0% |
| 3 | 732386 | 8.8% |
| 4 | 629896 | 7.5% |
| 7 | 543362 | 6.5% |
| 6 | 541610 | 6.5% |
| 9 | 537752 | 6.4% |
| 5 | 493316 | 5.9% |
| 8 | 461200 | 5.5% |
Latin
| Value | Count | Frequency (%) |
| o | 4199672 | |
| p | 2099836 | |
| d | 2099836 | |
| u | 2099836 | |
| c | 2099836 | |
| t | 2099836 | |
| r | 2099836 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 25151913 |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 4199672 | |
| p | 2099836 | |
| d | 2099836 | |
| u | 2099836 | |
| c | 2099836 | |
| t | 2099836 | |
| _ | 2099836 | |
| r | 2099836 | |
| 1 | 1117368 | 4.4% |
| 2 | 753269 | 3.0% |
| Other values (8) | 4382752 |
color
Text
| Distinct | 69 |
|---|---|
| Distinct (%) | < 0.1% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Memory size | 132.0 MiB |
Length
| Max length | 13 |
|---|---|
| Median length | 12 |
| Mean length | 8.9198956 |
| Min length | 3 |
Unique
| Unique | 3 ? |
|---|---|
| Unique (%) | < 0.1% |
Sample
| 1st row | GRIS |
|---|---|
| 2nd row | BEIGE |
| 3rd row | No encontrado |
| 4th row | BLANCO |
| 5th row | No encontrado |
| Value | Count | Frequency (%) |
| no | 984853 | |
| encontrado | 984853 | |
| gris | 424960 | |
| blanco | 242752 | 7.9% |
| beige | 172110 | 5.6% |
| multicolor | 89249 | 2.9% |
| marfil | 43116 | 1.4% |
| negro | 41193 | 1.3% |
| azul | 28952 | 0.9% |
| mate | 14583 | 0.5% |
| Other values (60) | 58068 | 1.9% |
Most occurring characters
| Value | Count | Frequency (%) |
| o | 2954559 | |
| n | 1969706 | 10.5% |
| N | 1289979 | 6.9% |
| 984853 | 5.3% | |
| e | 984853 | 5.3% |
| c | 984853 | 5.3% |
| t | 984853 | 5.3% |
| r | 984853 | 5.3% |
| a | 984853 | 5.3% |
| d | 984853 | 5.3% |
| Other values (26) | 5622103 |
Most occurring categories
| Value | Count | Frequency (%) |
| Lowercase Letter | 10833383 | |
| Uppercase Letter | 6912082 | |
| Space Separator | 984853 | 5.3% |
Most frequent character per category
Uppercase Letter
| Value | Count | Frequency (%) |
| N | 1289979 | |
| I | 760150 | |
| R | 644416 | |
| G | 641863 | |
| L | 534572 | |
| O | 496052 | 7.2% |
| E | 441075 | 6.4% |
| B | 428581 | 6.2% |
| S | 426714 | 6.2% |
| A | 387843 | 5.6% |
| Other values (17) | 860837 |
Lowercase Letter
| Value | Count | Frequency (%) |
| o | 2954559 | |
| n | 1969706 | |
| e | 984853 | 9.1% |
| c | 984853 | 9.1% |
| t | 984853 | 9.1% |
| r | 984853 | 9.1% |
| a | 984853 | 9.1% |
| d | 984853 | 9.1% |
Space Separator
| Value | Count | Frequency (%) |
| 984853 |
Most occurring scripts
| Value | Count | Frequency (%) |
| Latin | 17745465 | |
| Common | 984853 | 5.3% |
Most frequent character per script
Latin
| Value | Count | Frequency (%) |
| o | 2954559 | |
| n | 1969706 | |
| N | 1289979 | 7.3% |
| e | 984853 | 5.5% |
| c | 984853 | 5.5% |
| t | 984853 | 5.5% |
| r | 984853 | 5.5% |
| a | 984853 | 5.5% |
| d | 984853 | 5.5% |
| I | 760150 | 4.3% |
| Other values (25) | 4861953 |
Common
| Value | Count | Frequency (%) |
| 984853 |
Most occurring blocks
| Value | Count | Frequency (%) |
| ASCII | 18730203 | |
| None | 115 | < 0.1% |
Most frequent character per block
ASCII
| Value | Count | Frequency (%) |
| o | 2954559 | |
| n | 1969706 | 10.5% |
| N | 1289979 | 6.9% |
| 984853 | 5.3% | |
| e | 984853 | 5.3% |
| c | 984853 | 5.3% |
| t | 984853 | 5.3% |
| r | 984853 | 5.3% |
| a | 984853 | 5.3% |
| d | 984853 | 5.3% |
| Other values (24) | 5621988 |
None
| Value | Count | Frequency (%) |
| Ú | 77 | |
| É | 38 |
cantidad
Real number (ℝ)
High correlation  Skewed 
| Distinct | 6230 |
|---|---|
| Distinct (%) | 0.3% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 38.054409 |
| Minimum | 0 |
|---|---|
| Maximum | 489689 |
| Zeros | 508 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1 |
| Q1 | 1 |
| median | 3.02 |
| Q3 | 12.96 |
| 95-th percentile | 149.31 |
| Maximum | 489689 |
| Range | 489689 |
| Interquartile range (IQR) | 11.96 |
Descriptive statistics
| Standard deviation | 746.40079 |
|---|---|
| Coefficient of variation (CV) | 19.614043 |
| Kurtosis | 201638.51 |
| Mean | 38.054409 |
| Median Absolute Deviation (MAD) | 2.02 |
| Skewness | 377.71593 |
| Sum | 79908017 |
| Variance | 557114.15 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1 | 589514 | |
| 2 | 270560 | 12.9% |
| 5 | 73652 | 3.5% |
| 4 | 49233 | 2.3% |
| 25 | 47632 | 2.3% |
| 3 | 42377 | 2.0% |
| 10 | 35155 | 1.7% |
| 50 | 31807 | 1.5% |
| 6 | 22941 | 1.1% |
| 3.2 | 20553 | 1.0% |
| Other values (6220) | 916412 |
| Value | Count | Frequency (%) |
| 0 | 508 | |
| 0.15 | 1 | < 0.1% |
| 0.33 | 1 | < 0.1% |
| 0.4 | 30 | < 0.1% |
| 0.42 | 1 | < 0.1% |
| 0.48 | 74 | < 0.1% |
| 0.5 | 13 | < 0.1% |
| 0.52 | 7 | < 0.1% |
| 0.54 | 22 | < 0.1% |
| 0.57 | 1 | < 0.1% |
| Value | Count | Frequency (%) |
| 489689 | 1 | |
| 477673 | 1 | |
| 264542 | 1 | |
| 262179 | 1 | |
| 251039 | 1 | |
| 246562 | 1 | |
| 189346 | 1 | |
| 167730 | 1 | |
| 162539 | 1 | |
| 130591 | 1 |
precio
Real number (ℝ)
High correlation  Skewed 
| Distinct | 9634 |
|---|---|
| Distinct (%) | 0.5% |
| Missing | 531 |
| Missing (%) | < 0.1% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 9.2442102 |
| Minimum | 0 |
|---|---|
| Maximum | 12043.48 |
| Zeros | 398 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 0.12 |
| Q1 | 0.65 |
| median | 2.99 |
| Q3 | 6.27 |
| 95-th percentile | 43.33 |
| Maximum | 12043.48 |
| Range | 12043.48 |
| Interquartile range (IQR) | 5.62 |
Descriptive statistics
| Standard deviation | 29.530505 |
|---|---|
| Coefficient of variation (CV) | 3.1944866 |
| Kurtosis | 18669.961 |
| Mean | 9.2442102 |
| Median Absolute Deviation (MAD) | 2.39 |
| Skewness | 75.620122 |
| Sum | 19406417 |
| Variance | 872.05074 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.56 | 42332 | 2.0% |
| 0.5 | 36596 | 1.7% |
| 0.6 | 35823 | 1.7% |
| 0.47 | 33628 | 1.6% |
| 0.65 | 31075 | 1.5% |
| 0.07 | 30871 | 1.5% |
| 0.13 | 24975 | 1.2% |
| 0.08 | 22515 | 1.1% |
| 0.11 | 22032 | 1.0% |
| 0.15 | 21862 | 1.0% |
| Other values (9624) | 1797596 |
| Value | Count | Frequency (%) |
| 0 | 398 | < 0.1% |
| 0.01 | 2 | < 0.1% |
| 0.04 | 373 | < 0.1% |
| 0.05 | 60 | < 0.1% |
| 0.06 | 1545 | 0.1% |
| 0.07 | 30871 | |
| 0.08 | 22515 | |
| 0.09 | 11929 | 0.6% |
| 0.1 | 2643 | 0.1% |
| 0.11 | 22032 |
| Value | Count | Frequency (%) |
| 12043.48 | 1 | |
| 6451.52 | 1 | |
| 5833.75 | 1 | |
| 5592.58 | 1 | |
| 5591.54 | 1 | |
| 4955.1 | 1 | |
| 4816.04 | 1 | |
| 4099.78 | 1 | |
| 4044.55 | 1 | |
| 3846.93 | 2 |
valor
Real number (ℝ)
High correlation  Skewed 
| Distinct | 46571 |
|---|---|
| Distinct (%) | 2.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 39.991693 |
| Minimum | 0 |
|---|---|
| Maximum | 56876.09 |
| Zeros | 872 |
| Zeros (%) | < 0.1% |
| Negative | 0 |
| Negative (%) | 0.0% |
| Memory size | 16.0 MiB |
Quantile statistics
| Minimum | 0 |
|---|---|
| 5-th percentile | 1.17 |
| Q1 | 4.69 |
| median | 13.32 |
| Q3 | 37.13 |
| 95-th percentile | 146.3 |
| Maximum | 56876.09 |
| Range | 56876.09 |
| Interquartile range (IQR) | 32.44 |
Descriptive statistics
| Standard deviation | 165.07401 |
|---|---|
| Coefficient of variation (CV) | 4.1277075 |
| Kurtosis | 18659.162 |
| Mean | 39.991693 |
| Median Absolute Deviation (MAD) | 10.83 |
| Skewness | 86.352678 |
| Sum | 83975996 |
| Variance | 27249.429 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 1.13 | 33436 | 1.6% |
| 1.2 | 29827 | 1.4% |
| 2.34 | 20694 | 1.0% |
| 1.3 | 19555 | 0.9% |
| 2.49 | 15704 | 0.7% |
| 1.17 | 13754 | 0.7% |
| 0.35 | 10842 | 0.5% |
| 2.5 | 9731 | 0.5% |
| 2.67 | 9279 | 0.4% |
| 4.69 | 8032 | 0.4% |
| Other values (46561) | 1928982 |
| Value | Count | Frequency (%) |
| 0 | 872 | |
| 0.07 | 32 | < 0.1% |
| 0.08 | 2 | < 0.1% |
| 0.09 | 22 | < 0.1% |
| 0.1 | 36 | < 0.1% |
| 0.11 | 19 | < 0.1% |
| 0.12 | 40 | < 0.1% |
| 0.13 | 223 | < 0.1% |
| 0.14 | 32 | < 0.1% |
| 0.15 | 83 | < 0.1% |
| Value | Count | Frequency (%) |
| 56876.09 | 1 | |
| 55491.7 | 1 | |
| 32647.37 | 1 | |
| 32398.94 | 1 | |
| 30571.52 | 1 | |
| 30508.79 | 1 | |
| 30434.9 | 1 | |
| 29158.44 | 1 | |
| 28599.27 | 1 | |
| 24053.41 | 1 |
alineación con portafolio estratégico
Real number (ℝ)
High correlation  Skewed 
| Distinct | 24169 |
|---|---|
| Distinct (%) | 1.2% |
| Missing | 0 |
| Missing (%) | 0.0% |
| Infinite | 0 |
| Infinite (%) | 0.0% |
| Mean | 3.9451203 |
| Minimum | -24187.462 |
|---|---|
| Maximum | 4162.2267 |
| Zeros | 2045 |
| Zeros (%) | 0.1% |
| Negative | 1438 |
| Negative (%) | 0.1% |
| Memory size | 16.0 MiB |
Quantile statistics
| Minimum | -24187.462 |
|---|---|
| 5-th percentile | 0.12096 |
| Q1 | 0.473472 |
| median | 1.410048 |
| Q3 | 3.725568 |
| 95-th percentile | 14.263776 |
| Maximum | 4162.2267 |
| Range | 28349.689 |
| Interquartile range (IQR) | 3.252096 |
Descriptive statistics
| Standard deviation | 21.921295 |
|---|---|
| Coefficient of variation (CV) | 5.5565594 |
| Kurtosis | 708193.35 |
| Mean | 3.9451203 |
| Median Absolute Deviation (MAD) | 1.150848 |
| Skewness | -623.44251 |
| Sum | 8284105.7 |
| Variance | 480.5432 |
| Monotonicity | Not monotonic |
| Value | Count | Frequency (%) |
| 0.124416 | 33990 | 1.6% |
| 0.117504 | 33368 | 1.6% |
| 0.2592 | 24607 | 1.2% |
| 0.134784 | 22377 | 1.1% |
| 0.245376 | 18426 | 0.9% |
| 0.048384 | 15546 | 0.7% |
| 0.12096 | 13252 | 0.6% |
| 0.487296 | 11126 | 0.5% |
| 0.27648 | 10525 | 0.5% |
| 0.342144 | 8889 | 0.4% |
| Other values (24159) | 1907730 |
| Value | Count | Frequency (%) |
| -24187.46227 | 1 | |
| -1712.174976 | 1 | |
| -796.189824 | 1 | |
| -701.059968 | 1 | |
| -684.160128 | 1 | |
| -587.48544 | 1 | |
| -497.726208 | 1 | |
| -455.728896 | 1 | |
| -237.102336 | 1 | |
| -215.046144 | 1 |
| Value | Count | Frequency (%) |
| 4162.226688 | 1 | |
| 3887.008128 | 1 | |
| 2969.6544 | 1 | |
| 2900.669184 | 1 | |
| 2229.645312 | 1 | |
| 2158.852608 | 1 | |
| 2016.144 | 1 | |
| 1932.795648 | 1 | |
| 1932.436224 | 1 | |
| 1788.9984 | 1 |
Interactions
Correlations
| alineación con portafolio estratégico | cantidad | categoria | categoria_macro | cluster | edad | id | pedido | precio | valor | zona | |
|---|---|---|---|---|---|---|---|---|---|---|---|
| alineación con portafolio estratégico | 1.000 | 0.369 | 0.026 | 0.016 | 0.045 | -0.067 | -0.058 | -0.007 | 0.494 | 0.982 | 0.004 |
| cantidad | 0.369 | 1.000 | 0.038 | 0.006 | 0.066 | -0.042 | -0.058 | -0.011 | -0.530 | 0.390 | 0.006 |
| categoria | 0.026 | 0.038 | 1.000 | 1.000 | 0.091 | 0.039 | 0.024 | 0.015 | 0.062 | 0.018 | 0.032 |
| categoria_macro | 0.016 | 0.006 | 1.000 | 1.000 | 0.093 | 0.025 | 0.020 | 0.017 | 0.046 | 0.005 | 0.059 |
| cluster | 0.045 | 0.066 | 0.091 | 0.093 | 1.000 | 0.057 | 0.042 | 0.016 | 0.026 | 0.056 | 0.244 |
| edad | -0.067 | -0.042 | 0.039 | 0.025 | 0.057 | 1.000 | 0.061 | -0.001 | -0.028 | -0.069 | 0.085 |
| id | -0.058 | -0.058 | 0.024 | 0.020 | 0.042 | 0.061 | 1.000 | 0.523 | -0.007 | -0.060 | 0.037 |
| pedido | -0.007 | -0.011 | 0.015 | 0.017 | 0.016 | -0.001 | 0.523 | 1.000 | 0.005 | -0.006 | 0.014 |
| precio | 0.494 | -0.530 | 0.062 | 0.046 | 0.026 | -0.028 | -0.007 | 0.005 | 1.000 | 0.484 | 0.000 |
| valor | 0.982 | 0.390 | 0.018 | 0.005 | 0.056 | -0.069 | -0.060 | -0.006 | 0.484 | 1.000 | 0.027 |
| zona | 0.004 | 0.006 | 0.032 | 0.059 | 0.244 | 0.085 | 0.037 | 0.014 | 0.000 | 0.027 | 1.000 |
Missing values
Sample
| fecha | pedido | id | edad | municipio | zona | asesor | punto de venta | cluster | categoria_macro | categoria | subcategoria | producto | color | cantidad | precio | valor | alineación con portafolio estratégico | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1971-04-30 | 2 | 2 | 52 | EL CARMEN DE CHUCURI | SANTANDER | asesor_2 | punto_venta_2 | cluster_tienda_2 | categoria_macro_1 | categoria_2 | subcategoria_2 | producto_2 | GRIS | 1.00 | 32.88 | 32.88 | 2.920320 |
| 1 | 1971-04-30 | 3 | 3 | 31 | VILLANUEVA | LA GUAJIRA | asesor_3 | punto_venta_2 | cluster_tienda_2 | categoria_macro_2 | categoria_3 | subcategoria_3 | producto_3 | BEIGE | 2.00 | 0.56 | 1.13 | 0.117504 |
| 2 | 1971-04-30 | 4 | 4 | 43 | VILLANUEVA | LA GUAJIRA | asesor_4 | punto_venta_2 | cluster_tienda_2 | categoria_macro_3 | categoria_4 | subcategoria_4 | producto_4 | No encontrado | 1.00 | 8.38 | 8.38 | 1.251072 |
| 3 | 1971-04-30 | 5 | 5 | 31 | VILLANUEVA | LA GUAJIRA | asesor_5 | punto_venta_3 | cluster_tienda_3 | categoria_macro_2 | categoria_5 | subcategoria_5 | producto_5 | BLANCO | 21.14 | 2.27 | 47.99 | 3.729024 |
| 4 | 1971-04-30 | 6 | 6 | 49 | ARROYOHONDO | BOLÍVAR | asesor_6 | punto_venta_4 | cluster_tienda_2 | categoria_macro_4 | categoria_6 | subcategoria_6 | producto_6 | No encontrado | 1.00 | 9.96 | 9.96 | 1.223424 |
| 5 | 1971-04-30 | 6 | 6 | 49 | ARROYOHONDO | BOLÍVAR | asesor_6 | punto_venta_4 | cluster_tienda_2 | categoria_macro_2 | categoria_5 | subcategoria_5 | producto_7 | BRILLANTE | 1.89 | 2.79 | 5.28 | 0.397440 |
| 6 | 1971-04-30 | 7 | 7 | 50 | VILLANUEVA | LA GUAJIRA | asesor_7 | punto_venta_2 | cluster_tienda_2 | categoria_macro_2 | categoria_5 | subcategoria_5 | producto_8 | BLANCO | 17.01 | 2.93 | 49.89 | 4.434048 |
| 7 | 1971-04-30 | 7 | 7 | 50 | VILLANUEVA | LA GUAJIRA | asesor_7 | punto_venta_2 | cluster_tienda_2 | categoria_macro_2 | categoria_7 | subcategoria_5 | producto_9 | GRIS | 4.56 | 3.18 | 14.51 | 1.289088 |
| 8 | 1971-04-30 | 8 | 8 | 57 | CABRERA | CUNDINAMARCA | asesor_8 | punto_venta_5 | cluster_tienda_2 | categoria_macro_1 | categoria_1 | subcategoria_7 | producto_10 | BLANCO | 1.00 | 11.50 | 11.50 | 1.195776 |
| 9 | 1971-04-30 | 9 | 9 | 50 | VILLANUEVA | LA GUAJIRA | asesor_2 | punto_venta_2 | cluster_tienda_2 | categoria_macro_2 | categoria_8 | subcategoria_8 | producto_11 | MULTICOLOR | 2.00 | 3.06 | 6.11 | 0.857088 |
| fecha | pedido | id | edad | municipio | zona | asesor | punto de venta | cluster | categoria_macro | categoria | subcategoria | producto | color | cantidad | precio | valor | alineación con portafolio estratégico | |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 2099826 | 1972-09-01 | 933930 | 419221 | 29 | CURITI | SANTANDER | asesor_380 | punto_venta_10 | cluster_tienda_3 | categoria_macro_1 | categoria_1 | subcategoria_26 | producto_410 | No encontrado | 1.00 | 0.92 | 0.92 | 0.082944 |
| 2099827 | 1972-09-01 | 933931 | 419224 | 28 | CASTILLA LA NUEVA | META | asesor_45 | punto_venta_16 | cluster_tienda_3 | categoria_macro_4 | categoria_9 | subcategoria_25 | producto_267 | No encontrado | 1.00 | 15.90 | 15.90 | 2.374272 |
| 2099828 | 1972-09-01 | 933932 | 419225 | 46 | CURITI | SANTANDER | asesor_45 | punto_venta_16 | cluster_tienda_3 | categoria_macro_4 | categoria_9 | subcategoria_25 | producto_2690 | No encontrado | 1.00 | 20.46 | 20.46 | 3.335040 |
| 2099829 | 1972-09-01 | 933933 | 368060 | 57 | CURITI | SANTANDER | asesor_219 | punto_venta_15 | cluster_tienda_2 | categoria_macro_2 | categoria_5 | subcategoria_5 | producto_3200 | MULTICOLOR | 21.06 | 3.67 | 77.30 | 8.035200 |
| 2099830 | 1972-09-01 | 933933 | 368060 | 57 | CURITI | SANTANDER | asesor_219 | punto_venta_15 | cluster_tienda_2 | categoria_macro_2 | categoria_5 | subcategoria_5 | producto_699 | GRIS | 17.01 | 3.04 | 51.72 | 5.377536 |
| 2099831 | 1972-09-01 | 933933 | 368060 | 57 | CURITI | SANTANDER | asesor_219 | punto_venta_15 | cluster_tienda_2 | categoria_macro_2 | categoria_7 | subcategoria_5 | producto_3328 | MULTICOLOR | 45.50 | 2.73 | 124.03 | 11.021184 |
| 2099832 | 1972-09-01 | 933934 | 78489 | 29 | CASTILLA LA NUEVA | META | asesor_45 | punto_venta_16 | cluster_tienda_3 | categoria_macro_2 | categoria_5 | subcategoria_5 | producto_4719 | BLANCO | 8.64 | 3.56 | 30.78 | 2.865024 |
| 2099833 | 1972-09-01 | 933935 | 415279 | 42 | CASTILLA LA NUEVA | META | asesor_45 | punto_venta_16 | cluster_tienda_3 | categoria_macro_4 | categoria_10 | subcategoria_37 | producto_1414 | No encontrado | 1.00 | 33.04 | 33.04 | 3.753216 |
| 2099834 | 1972-09-01 | 933936 | 419226 | 47 | NATAGAIMA | TOLIMA | asesor_45 | punto_venta_16 | cluster_tienda_3 | categoria_macro_2 | categoria_5 | subcategoria_5 | producto_511 | MARFIL | 11.52 | 3.75 | 43.14 | 4.485888 |
| 2099835 | 1972-09-01 | 933936 | 419226 | 47 | NATAGAIMA | TOLIMA | asesor_45 | punto_venta_16 | cluster_tienda_3 | categoria_macro_2 | categoria_7 | subcategoria_5 | producto_248 | No encontrado | 1.60 | 3.18 | 5.09 | 0.452736 |